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ABSTRACT 


An approximation formula is derived which provides a simplified 
and efficient method of calculating the Oth percentile point of a 
two parameter beta function when the two parameters are known. A 
specific application is presented by utilizing the formula to com- 
pute the lower 100(1-@2)% Bayesian confidence limit for the reli- 
ability of a component when the prior distribution of that 
reliability is known to be beta with parameters a' and b'. The 
posterior distribution is then determined by mission testing n 
items and recording the number of successes, s. This distribution 
is known to be beta with parameters a and b, where a = a' + § ane! 
b= b! +n - s. Therefore, the formula can be utilized to deter- 
mine the @th percentile point of this posterior distribution which 
by definition is the lower 100(1-@)% confidence limit for the 


Welaabilavy of the component. 
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I. INTRODUCTION 


The object of this paper was to determine a method of approxi-~ 
mating che Mth percentile point for the reliability of a component 
when this reliability is known to have a beta distribution. More 
specifically, the prime interest was in approximating percentile 
points of highly skewed, two parameter beta distributions. The 
approximation formula that was derived provides a simplified 
method for determining the &th percentile point of a beta distri- 
bution when the two parameters a and b are known. 

Given that the reliability R of a component is distributed 
beta with parameters a‘' and b’ the posterior distribution of R can 


then be determined. 


Let n = the number to be sampled 
Let s = the number of successes 
beter = tne number of failures 


The n items are then mission tested. By applying Bayes Theoren, 
the posterior distribution of R is found to be beta with para- 
Peters (a' +s) and (b' + f). 

Petes, —- a + Sand b= b' + £f 
Therefore, the posterior distribution of R is beta with parameters 
a and b. Once the values of a and b have been determined, the 
formula below can be utilized to approximate Py» the @th percen- 


tile point of the reliability of the component. 


Pp S Sin 21 + Arcsin ere + phe eee 
. 2\a+b-1 Ee 3(atb=1) 





Where Z aS a nwmber such that p|z a. 


1-o = 1 -& and Z is 


1-0] 


distributed normal (0,1). Therefore, ~ can be looked up in 


ate 
any Standaid Normal Tables. 





II. DISCUSSION 


A. BAYESIAN RELIABILITY 
Given that the PELOL Gensity, h,(r3;a',b'), is known and that 
the conditional density, Io} p(slr)> has been determined by testing, 


the posterior density, (fea. can be computed by use of Bayes 


tris 


Theorem: 


9 (si) belie oa' sb) 
fetes) a eins ios 
te Io} p(slr) h,(r5a',b')dr 


Once the posterior density has been determined the 100(1-@)% 


Lower Bayesian Confidence Limit, is defined by, 


ey 
PR 1a)” e = 1-0 


“A 


where R is the reliability of the comoonent. Therefore, Ri (a) is 


the @th percentile point of the posterior distribution of R. 


B. APPROXIMATING A BINOMIAL DISTRIBUTION WITH A BETA DISTRIBUTION 
The following identity relates the binomial distribution with 
the beta distribution. If U is distributed beta with parameters 


s and (n - s + 1), where n and s are positive integers and s Sn, 


nN nN 5 e 
eae. (S)pJa-py?3, for 0 = pl. 


iH 


Therefore, P(U = p) P(X 2 s) where X is distributed binomial 


with parameters n and p. 





C. THE POSTERIOR DISTRIBUTION 
Given that the prior density, hp(r3a',b") is beta with para- 


mewers a' and b* then 


ite eb ) a't-l be=1 
e t t = — 4 = < 
hu (r5a gb) T(a'yl(b') x (ler) 3 Lona be I. 
The conditional density, Gop p(slx); is binomial with parameters 


n and xr where 


n = the number of items to be mission tested 
r = the probability of success 
s = the number of successes observed. 


Therefore, the conditional density is 


n 


Io} p( sit) Meas). 


The posterior density, (x|s) 1S given by 


tels 


| penetra ',b*) 

Ses p(s R ? bd 

fai g(tls) e 
o 9 


sfR(sl*) h,(r5a',b")dr 


n T(a't+b! t t 
=e 1{ ee a) bt-1 
OLeagen P(aty(pt te (l-r) 
“ifn 
cq Gait. } th te 
( (2)x5(a-2)” s a ge di 
O 


P(a')P(b') o 


a't+s-l] b'+n-sel 


ia (1-r) 


UY foes tanuce 
x? s eal +n-s-1 dr 


O 
a'+s-1 b'+n-s-l]1 


ey (1-r) 
(at+s)l(b'+n-s) 


ia +b +n) 





ewer chin) a't+s-] b'+n-s-1 
= x (Lax) 
C(atts)P(b'+n-s) 
Therefore, the posterior density is beta with parameters (a' + s) 
Beem t T= Sj). het a =a" +s andb=b' +n-=+s. Therefore, 


the posterior distribution of R is beta with parameters a and b. 





III. THE APPROXIMATION FORMULA 


A. DERIVATION 


If X is binomial with parameters ri and p, then Y = 2 Arcsin 2 
in radians is approximately normally distributed with parameters 


ub and a”, where &@ = 2 Arcsin {p and hi =,* 








| il 
2rKcescan. ¥ yi=2n 


an 
25 ~ 
E | i Py (x) ax 
: \ i : 
2 Arcsin Vet ee 2 Axcsin| p 
} en 


P L(y) ~ = 1 


yn 





Where - Toya COrreeriom for continuity analogous to that used 


with the normal approximation. 


If U is distributed beta with parameters s and (n - s + 1), 


then 
n n : ie 
pup | = > (5) p? (1-p) EASE a | 
Js 
xX Slee 
and p [X22] =r tose) 


where X is binomial with parameters n and p. 


x2 s| 
n n 


PL2 aresin| aS 2 Arcsin | =] 
n n 
= . Ss : 
PLz (2 aresin (= 2 Arcsin \P) |n J 


plu < pl P 





"Brownlee, K.A., Statistical Theory and Methodology in 
Science and Engineering, p. 115-116, Wiley, 1960 
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Where p = E es and Z is distributed normal (0,1). Now use = as 


amecomtinuity correction factor. 


plu s p] = plz 2(2 Arcsin tea = - 2 arcsin{p) Yn ] 
= PLZ $(2 Arcsin LP 2 Arcsin [e423 aS ) yn] 
If U is beta with parameters a and b, where a = s andb=n-stl. 
b=enesFI1 
b= nt =a oF OL 
nh=a+tb- tl 
ES a er 
n 2n 2n Z(a + b= 1) a +b -1 


therefore, 


al 
< < = en a ke {z 2 
Plu pl ee (2 Arcsin | p 2 Arcsin a | atb-1 


and 


ei ee | 
mee ne ast 


Therefore, 2 Arcsin U is approximately normally distributed 


N{ 2 Arcsin Se : i ri 


ineeayecitan agalysis we want p such that PlU > p] = 1-e, 
OS 0a AN 


Then p is the lower 100(1-@)% Bayesian Confidence Interval on U. 


ig ats 
= > i pA i Z = 
Plu > p] PLZ (2 Arcsin |p 2 Arcsin aa ) {a+b ee 
Therefore, 
2 1 \ 2 + , ats ‘ +b ii 
(2042csin ip - 2 Arcsin Es ) Ya+b-1 = 24 wy 


where 21 aw is a number that Dez 725 Be = 1-4 and can be looked up 





in any Standard Normal Table. 


afal 








+h 
; Arcsin fr 2 Arcsin \|= — fatb-1 = Z 


oe 





atb- 1 +02 
Zz =e 
; ~O : 3 

2 Arcsin |p = ey 2 Arcsin Hee 

age saa 
Zz 1 
- fal = re Loe 
{P = Sin — + Arcsin eat 

2 \ atb-1 


2 


{le 





° ° 21-0 e ats 
op Sh Pee Sins apts &. 
2Y atb-1 | 


For simplicity, the fraction % was dropped from the numerator of 
the second term. This had the effect of reducing the approximated 
value of p. To compensate for this reduction, a correction factor 


was added to the formula as shown below: 





Zz 


= |Sin et Arcsin =a peers Se 
P 2 [Tatb-1 \atb-1 2(atb-1) 


However, if b< 1.0 then > 1.0 and the Aresin is not defined 


wa 
atbe-1 
when the argument is greater than one. To prevent this problen, 
so that the approximation formula can be used for all values of 

b, the second term was modified by adding 1.0 to the denominator. 
After comparing the approximated values with the computed ones, 
the correction factor was also modified by multiplying the denomi- 
nator by 3 instead of 2. This resulted in reducing the approxi- 
mation to just below the computed value of the @th percentile 
points for low values of b, instead of just above it, as it was 


found to be originally. This change also caused the approximated 


values to be more uniformly accurate throughout the range of b. 


1s 





Therefore, the final approximation formula for the @th 


percentile points of the beta distribution is: 
12 


Pp Sin eo ea SLT: | aoe fs pee eee 
104 SS pe 
7 2 Vatb-1 [as5) 3(atb-1) 


B.  AGGURACY RESULTS 

The &tn percentile points generated by the approximation for- 
mula compare very favorably with the computed values which will 
hereafter be referred to as the "exact'! values. As shown in. 
Tables I, II and III, the approximation formula results in ex- 
tremely accurate values for the percentile points of the beta 
distribution, especially for values of b less than or equal to 
five. Examination of the results indicates that the approximated 
value is less than the exact value when b values are less than 
2.5 ana aveater When b values are greater than 2.5. Therefore, 
for values of b near 2.5, the approximation formula results in 
values which are very nearly the same as the exact values. The 
accuracy cf the approximation is also a function of the parameter 
ae As the value of a increases the approximation becomes even 
more accurate. After using the approximation formula to compute 
various percentile points over a wide range of a and b values, it 
was noted that the worst estimated value was less than three 
thousandths away from the exact value for values of a greater 
than 40. Therefore, the formula will produce values for the &th 
percentile points of two parameter beta distributions with an 


accuracy which should be completely sufficient for most applications. 


13) 





Comparison of Approximate and Exact 5th Percentile 


TABLE I. 


Points for the Beta Distribution 
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Comparison of Approximate and Exact 10th Percentile 


TABLE II. 


tbution 


Distr 
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Comparison of Approximate and Exact 20th Percentile 


TABLE III. 
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This is especially true for values of the 20th percentile points, 
Table III, for which the approximated values are exceptionally 


accurate. 


C. ALTERNATE APPROXIMATION FORMULAS 
If the approximation formula is used with the continuity 


correction factor that was originally derived, 





2 | 2 
joy lj Sake a + Arcsin = — a 
2 farb-1 \ a+b 2(atb-1) . 


the resulting percentile points will all be greater than the exact 
values. The accuracy of the approximated values is extremely good 
for bS 1.0, but as the value of b increases, the approximated 
values exceed the exact values by an ever increasing amount. AS 
a result, the approximated values become unacceptable when com- 
pared with the values generated by the formula using Reet as 
the continuity correction factor. See Table IV for a comparison 
of the values generated by the two formulas. 

When the continuity correction factor is decreased by using 
4(at+b-1) for the denominator, the appr oximated values will be 
less than the exact values. This trait could be useful if con- 


servative estimates were desired. The approximation formula 


- Eee 


ae 
, ; a al 
2 [atb-1 | eles A (atb-1) 





is most accurate for values of b2 4.0 due to the fact that, as 


the value of b increases, the approximated value of p increases 


7 





with respect to the exact value. However, the approximated 
percentile points for lower values of -b are not nearly as accurate 
as those given by the chosen formula. See Table IV for a com- 
parison of the values generated by the two formulas. 

The approximation formula can be simplified by not subtractirg 


1.0 from the denominators of the first term and the correction 





factor. The resulting formula, 
2 
Zz ——— 
F 1-0 Be af 
i 2Yatb Jab ae) 


Qives very accurate approximations of the percentile points and 
1S quite acceptable for most applications. This formula is most 
accurate for b* 2.5 and should be used whenever b is restricted 
to these values. The chosen formula is only superior in that it 
results in extremely accurate estimates for all values of b, not 
just very Low values. However, the formula presented above would 
actually be preferred whenever b is restricted to very low values. 
See Table IV for a comparison of the values generated by the two 


formulas. 
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TABLE IV. Comparison of Approximate and Exact 20th Percentile 
Points Generated by Selected Approximation Formulas 


for the Beta Distribution 








a b  [j/Exact F-1 F=2 F-3 
cee , eg 
50 25 | 98362 -98151 | .9848%7 | 97982 
SO) 11,0 ee 90719 }.97052 1) 96552 
5O 11.5 |j.95486 | .95429 | .95759 | .95264 
Be 12.0 |. 942h2 94225 | .94552 | .94061 
5012.5 § 930690 1293081 69340 5 41929020 
50 | 3.0 1}.91950 | .91987 | .92307 | .91826 
504 3.5 I 90876 -90933 | .91250 | .90774 
50 | Qose | 89840 -89915 | .90229 | .89758 
50 74.5 .88838 | .88929 | .89240 | .88773 
50 a 4.07867 | .S7971 |G be200 28757 
100} .5 j}/.99180 | .99068 | .99236 | .98984 
100 | 3-0 -98403 | .98337 | .98504 | .98254 
100 , 1.5 97711 |497670 | 297836 | 29755 
100 ! 2.0 1.97064 | .97040 | .97205 | .96958 
100 ee .96447 | .96436 | .96600 | .96354 
HOO+| 35071795551 1 2958500 29001 | 7o5759 
MOO 3.5 19527351 2952811 2 9544 3: | 295200 
100 | 4.0 |].94711 | .94725 | .94887 | .94644 
mOG 1.5 || 94160 | .O418i 1.0432 oie 
100 | 5.0 |}.93622 | .93648 | .93808 | .93568 
Boas Pp sin we + Arcsin =) 2 cio 








+b= 
Dalen oil 2(atb-1) 





ee 
4(atb-1) 


Zz 
seas wont) be - 





2 {atb- 1 


= z, y 
: ee a ll 
F-4 p= [si AT eS i 5) + 3(atb) 
zZ 


[asp 


Zz 
35D = eer ie + Avesan 
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IV. EXAMPLES 


A. EXAMPLE 1 
Find the 20th percentile point for a = 100.0 and b = 2.0. 
Assume a' = 51.0 and b' = 1.0. A random sample of 50 items 


was then mission tested and one failure was observed. 


n = 50.0 and s = 49.0 
alt= a etc oO + 749.0 =n100e0 


b= b' +n -s =1.0 + 50.0 - 49.0 = 2.0 


Q@Q@= , = a 
20 and Zn o Z 89 842 
Og eee 
: 1-& : a iL 
pan | ol ———————— aresin| —— a 
2 2 (atb-1 hie sonra) 
ie 
p = | Sin gree = Aresin | ae + : 
— Sn SQece = = > $) Sse == ss ee me ee ee 
J, ¢ a ’ 
720 | 2 loo+Del 100+2 3(100+2-1) 


[sin (-.04189 + 1.43031) 17 + .00330 


(.98341)° + .00330 = .96709 + .00330 = .97039 


The table value is .97064 and the difference is only -.00025. 


Bi; EXAMPLE 2 
Find the 25th percentile point for a = 50.0 and b = 2.5. 
Assume a' = 26.0 and bt = 1.5. A random sample of 25 items was 


then mission tested and one failure was observed. 


25.07and s 7-270 


3 
FF 


a=zat +s = 26.0 + 24.0 = 50.0 


PAG, 


4h 





b = b' $n-=)5 =35 +o oe0lee oes 


R 
tI 


(2) andes = Z 


Or iowa - 675 











Lz r 
: 1-& ine eee. 1 
oe = Spa Sa ssesin {3 ) te 
| . {atb-1 ae | Ss 
|2 
- .675 ; 50 ] 





[sin (-.04703 + 1.35088)]” + .00647 


I 


(.96456)7 + .00647 = .93038 + .00647 = .93685 


The table value is .93680 and the difference is only +.00005. 


Za 
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formula can be utilized to determine the &th percentile point of this 
posterior distribution which by definition is the lower 100(1-@)% confidence 
limit for the reliability of the component. 
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